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TEST AND MODEL FOR ALZHEIMER'S DISEASE 

BACKGROUND OF THE INVENTION 
Alzheimer's disease is a progressive disease known generally as 
5 senile dementia. Broadly speaking the disease falls into two categories, namely 
late onset and early onset. Late onset, which occurs in old age (65 + years), may 
be caused by the natural atrophy of the brain occurring at a faster rate and to a 
more severe degree than normal. Early onset Alzheimer's disease is much more 
infrequent but shows a pathologically identical dementia with diffuse brain atrophy 
10 . which develops well before the senile period, Le^, between the ages of 35 and 60 
years. There is evidence that one form of this type of Alzheimer's disease shows 
a tendency to run in families and is therefore known as familial Alzheimer's 
disease (FAD). 

In both types of Alzheimer's disease the pathology is the same but 
15 the abnormalities tend to be more severe and more widespread in cases beginning 
at an earlier age. The disease is characterized by two types of lesions in the brain, 
these are senile plaques and neurofibrillary tangles. 

Senile plaques are areas of disorganized neuropil up to 150/xm 
across with extracellular amyloid deposits at the center. Neurofibrillary tangles 
20 are intracellular deposits of amyloid protein consisting of two filaments twisted 
about each other in pairs. 

The major protein subunit, jS-amyloid protein, of the amyloid 
filaments of the senile plaque is a highly aggregating small polypeptide of 
approximate relative molecular mass 4,500, This protein is a cleavage product of 
25 a much larger precursor protein called amyloid precursor protein (APP). 

At present there is no known effective therapy for the various forms 
of Alzheimer's disease (AD). However, there are several other forms of dementia 
for which treatment is available and which give rise to progressive intellectual 
deterioration closely resembling the dementia associated with Alzheimer's disease. 



wo 92/13069 



PCT/GB92/00123 



2 

A diagnostic test for AD would therefore provide a valuable tool in the diagnosis 
and treatment of these other conditions, by way of being able to exclude 
Alzheimer's disease. It will also be of value when a suitable therapy does become 
available. 

5 Also important is the development of experimental models of 

Alzheimer's disease that can be used to define further the underlying biochemical 
events involved in AD pathogenesis. Such models could presumably be employed, 
in one application, to screen for agents that alter the degenerative course of 
Alzheimer's disease. For example, a model system of Alzheimer's disease could 
10 be used to screen for environmental factors that induce or accelerate the 
pathogenesis of AD, In contradistinction, an experimental model could be used 
to screen for agents that inhibit, prevent, or reverse the progression of AD. 
Presumably, such models could be employed to develop pharmaceuticals that are 
effective in preventing, arresting, or reversing AD. 

15 

SUMMARY OF THE INVENTION 
The present invention provides model systems of Alzheimer's 
disease, wherein the model system comprises a DNA sequence encoding an 
amyloid precursor protein (APP) isoform or fragment that has an amino acid other 
2 0 than valine at the amino acid position corresponding to amino acid residue position 
717of APP770. 

In a first embodiment, the present invention provides an isolated 
DNA sequence that encodes an amyloid precursor protein (APP) isoform or 
fragment that has an amino acid other than valine at the amino acid position 
25 corresponding to amino acid residue position 717 of APP770. 

In a second embodiment, the present invention provides a transgenic 
nonhuman animal that harbors at least one integrated copy of a human DNA 
sequence that encodes an amyloid precursor protein (APP) isoform or fragment 
that has an amino acid other than valine at the amino acid position corresponding 
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to amino acid residue position 717 of APP770. 

In a third embodiment, the present invention provides a transgenic 
nonhuman animal wherein at least one of the endogenous nonhuman AFP alleles 
has been completely or partially replaced by all or a portion of a human APP gene 
5 that includes a codon 717 that does not encode valine. 

In a fourth embodiment, the present invention provides cells, 
typically mammalian cells and preferably mammalian cells of the neural, glial, or 
astrocytic lineage, that have been transformed or transfected with a heterologous 
DNA sequence, or have been derived from a transgenic nonhuman animal, wherein 
10 the cells express an amyloid precursor protein (APP) isoform or fragment that has 
an amino acid other than valine at the amino acid position corresponding to amino 
acid residue position 717 of APP770. In accordance with standard protocols, 
cultured human cells, either primary cultures or immortalized cell lines, may be 
transfected, either transiently or stably, with a mutant APP allele so that the 
15 cultured human cell expresses a mutant APP polypeptide. 

In a fifth embodiment, the present invention provides a method of 
producing transgenic nonhuman animals and transformed cells that contain a DNA 
sequence encoding an amyloid precursor protein (APP) isoform or fragment that 
has an amino acid other than valine at the amino acid position corresponding to 
20 amino acid residue position 717 of APP770. 

In a sixth embodiment, the present invention provides a method of 
producing, free from other human proteins, a human amyloid precursor protein 
(APP) isoform or fragment that has an amino acid other than valine at the amino 
acid position corresponding to amino acid residue position 717 of APP770. 
25 In a seventh embodiment, the present invention provides a human 

amyloid precursor protein (APP) isoform or fragment, free from other human 
proteins, that has an amino acid other than valine at the amino acid position 
corresponding to amino acid residue position 717 of APP770, 
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In an eighth embodiment, the invention provides a method for 
detecting an APP allele that is linked (i.e., cosegregates with) a genetic 
predisposition to Alzheimer's disease, particularly early onset AD, wherein such 
a pathognomonic APP allele is detected by determining that codon 717 of the allele 
does not encode valine. Preferably, a pathognomonic APP allele is detected when 
codon 717 is determined to encode either isoleucine, glycine, or phenylalanine. 
Thus, methods for locating the presence of genetic alterations associated with 
Alzheimer's disease are provided. This diagnostic method may be used to predict 
the development of the disease prior to onset, for genetic screening, or to detect 
a specific mutation in an experimental nonhuman animal or a cell. 

In a ninth embodiment, the invention provides a human variant APP 
polypeptide free of other human proteins, typically present in a cell of a nonhuman 
animal. The invention also relates to an isolated nucleic acid encoding such a 
polypeptide and to uses and applications of such nucleic acid as are described 
above in relation to the specific embodiment of the invention which involves an 
amino acid substitution at position 717 (as defined in relation to APP770), 

According to one aspect of the invention there is provided a method 
for detecting the presence, in a nucleic acid or other sample removed from a 
subject, of the gene for Alzheimer's disease comprising identifying a genetic 
alteration in a gene sequence coding for APP. Such genetic alterations may 
include mutations, insertions or deletions. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 illustrates a first pedigree in which early onset AD is 
apparently inherited as an autosomal dominant disorder. The average age of onset 
in this family is 57±5 years. Black symbols denote affected individuals and 
oblique lines indicate individuals who are deceased. Females are denoted by 
circles and males by squares. Triangles are used in the present generation to 
preserve anonymity. In generation II the spouses of the two affected brothers were 
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sisters. Samples were available from the 13 individuals whose haplotypes are 
illustrated, from a further 19 children and spouses of these individuals and from 
7 more distantly related unaffected individuals. Beneath the pedigree are 
ideograms of the two chromosomes 21 in each individual of the third generation 
5 at four loci on the long arm of the chromosome. The linkage data suggest that the 
black chromosomes were inherited from the affected fathers. 

Fig. 2 shows an autoradiograph of a sequencing gel from part of 
exon 17 of the APP gene in a normal and an affected individual from the Fig. 1 
pedigree showing a single base pair change at base pair 2149 in the affected 
10 individual. This C to T transition leads to an amino acid substitution of a valine 
by an isoleucine at codon 717. 

Fig. 3 shows part of the amino acid sequence encoded by exons 16 
and 17 of the APP gene showing the mutation valine to isoleucine (V to I) within 
the transmembrane domain and the mutation causing HCHWA-D (E to Q) in the 
15 extracellular domain. The shaded region of the transmembrane domain and the 
boxed amino acids of the extracellular domain represent the sequence of the 
deposited jS-amyloid peptide. Adapted from Kang et al. (1987) Nature 325:733. 

Fig. 4 shows BcR digests of the exon 17 PGR product from 
unaffected and affected individuals in an early onset AD family showing 
20 co-segregation of the restriction site and the disease. 

Fig. 5 shows the pedigree of family F19, together with D21S210 

data. 

Fig. 6 shows APP exon 17 sequences in an affected and unaffected 
member of F19. In the affected member there is a G- > T transition at position 2150. 
25 Fig. 7 shows the sequence of APP695. 

Fig. 8 shows the sequence of APP751. 
Fig. 9 shows the sequence of APP770. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The accumulation of /S-amyloid protein (A4) in particular brain 
regions is one of the main pathologic characteristics of Alzheimer's disease. The 
iS-amyloid protein is an approximately 4 kD protein (39 to 42 amino acids) which 
is derived, as an internal cleavage product, from one or more isoforms of a larger 
amyloid precursor protein (APP). There are at least five distinct isoforms of APP 
containing 563, 695, 714, 751, and 770 amino acids, respectively (Wirak et al, 
(1991) Science 253:323). These isoforms of APP are generated by alternative 
splicing of primary transcripts of a single gene, designated the APP gene, which 
is located on human chromosome 21. It is known that most of the APP isoforms 
are glycosylated transmembrane proteins (Goldgaber et al. (1987) Science 
235:877), and that four of the isoforms, AA563, APP714, APP751 and APP770, 
have a protease inhibitor domain that is homologous to the Kunitz type of serine 
protease inhibitors. The jS-amyloid (A4) segment comprises approximately half of 
the transmembrane domain and approximately the first 28 amino acids of the 
extracellular domain of an APP isoform. 

Proteolytic processing of APP in vivo is a normal physiological 
process. Carboxy-terminal truncated forms of APP695, APP751, and APP770 are 
present in brain and cerebrospinal fluid (Palmert et al. (1989) Proc. Natl Acad. 
Sci. U.S.A. 86:6338; Weidemann et al. (1989) Cell 57:115) and result from 
cleavage of the APP isoform at a constitutive cleavage site within the A4 peptide 
domain of an APP isoform (Esch et al. (1990) Science 248:1122). Normal 
proteolytic cleavage at the constitutive cleavage site yields a large (approximately 
100 kD) soluble, N-terminal fragment that contains the protease inhibitor domain 
in some isoforms, and a 9-10 kD membrane-bound, C-terminal fragment that 
includes most of the A4 domain. 

Generation of pathogenic jS-amyloid (A4) protein appears to be the 
result of aberrant or alternative proteolytic processing of APP, such that normal 
cleavage at the constitutive site within the A4 domain does not occur, but rather 
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cleavage occurs at two specific sites which flank the A4 domain. One of these 
aberrant cleavage sites is in the transmembrane domain and the other aberrant 
cleavage site is located approximately at the N-terminus of the first 28 amino acids 
of the extracellular domain (see Fig. 3). Such aberrant proteolytic cleavage 
5 produces the jS-amyloid A4 polypeptide which is prone to forming dense 
amyloidogenic aggregates that are resistant to proteolytic degradation and removal. 
The resultant jS-amyloid aggregates presumably are involved in the formation of 
the abundant amyloid plaques and cerebrovascular amyloid that are the 
neuropathological hallmarks of Alzheimer's disease. However, the exact aberrant 
10 cleavage sites are not always precise; i8-amyloid molecules isolated from the brain 
of a patient with AD show N- and C- terminal heterogeneity. Therefore, the 
aberrant cleavage pathway may involve either sequence-specific proteolysis 
followed by exopeptidase activity (creating end-heterogeneity), or may not be 
sequence-specific, 

15 The APP gene is known to be located on human chromosome 21. 

A locus segregating with familial Alzheimer's disease has been mapped to 
chromosome 21 (Hyslop et al (1987) Science 235:885) close to the APP gene. 
Recombinants between the APP gene and the AD locus have been previously 
reported (Schellenberg et al. (1988) Science 241:1507). The data appeared to 

20 exclude the APP gene as the site of any mutation that might cause FAD (Van 
Broekhoven et al. (1987) Nature 329:153; Tanzi et al. (1987) Nature 329:156). 

Recombinant DNA technology provides several techniques for 
analyzing genes to locate possible mutations. For example, the polymerase chain 
reaction (Bell (1989) Immunology Today, 10:351) may be used to amplify specific 

25 sequences using intronic primers, which can then be analyzed by direct 
sequencing. 

Researchers working in the area of the hereditary cerebral 
haemorrhage with amyloidosis of the Dutch type ("HCHWA-D") (Levy et al 
(1990) Science 248:11224) found a substitution of Glu to Gin at residue 618 (using 
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the APP695 numbering system) in APP which is thought to result in the deposition 
of /S-amyloid in the cerebral vessels of these patients. The present inventors have 
identified a single base substitution, a C to T transition at base pair 2149, has been 
found in part of the sequence of the APP gene in some cases of familial 
Alzheimer's disease. This base pair transition leads to an amino acid substitution, 
i.e., valine to isoleucine at amino acid 717 (APP770) (see Yoshikai et al. (1990) 
Gene 87:257), close to the C-terminus of the /S-amyloid protdn. This suggests that 
some cases of Alzhdmer's disease are caused by mutations in the APP gene, and 
specifically mutations that change codon 717 such that it aicodes an amino add 

other than valine. 

Additionally, a further single base substitution, a T to G transition 
at adjacent base pair 2150, has been found in part of the sequence of the APP gene 
in other cases of familial Alzheimer's disease. This base pair transition leads to 
a different amino acid substitution, namely valine to glydne, at amino add 717, 
ther^y strengthening the argument that some cases of Alzheimer's disease are 
caused by mutations in the APP gene, specifically at codon 717. 

It is now clear that a mutation in the APP gene locus that results in 
a substitution of isoleucine for valine at codon 717 (residue 642 in APP695) gives 
rise to AD in some femilies (Goate et al. (1991) Nature 242:704). A second APP 
allelic variant wherein glycine is substituted for valine at codon 717 is now 
identified, and is so closely linlod to the AD phenotype as to indicate that allelic 
variants at codon 717 of the APP gene, particularly those encoding an amino acid 
other than valine, and more particularly those encoding a isoleucine, glycine, or 
phenylalanine, are pathogenic and/or pathognomonic alleles (Chartier-Harlin et al. 
(1991) Nature 252:844). 

Proteolysis on either side of the /3-amyloid (A4) region of APP may 
be enhanced or qualitatively altered by the specific mutations at codon 717, 
increasing the rate of ^-amyloid deposition and aggregation. Such codon 717 
mutations may increase jS-amyloid formation by providing a poorer substrate for 
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the main proteolytic pathway (cleavage at the constitutive site) or a better substrate 
for a competing, alternative cleavage pathway (at aberrant cleavage sites). 

DEFINITIONS 

5 A number of terms and expressions are used throughout the 

specification and, to facilitate the understanding thereof, the following definitions 
are provided: 

As used herein, "exon" refers to any segment of an interrupted gene 

that is represented in the mature RNA product. 
10 As used herein, "intron" refers to a segment of an interrupted gene 

that is not represented in the mature RNA product. Introns are part of the primary 

nuclear transcript but are spliced out to produce mRNA, which is then transported 

to the cytoplasm. 

As used herein, the phrase "gene sequence coding for amyloid 
15 protein precursor" may be interpreted to mean the DNA and cDNA sequence as 

detailed by Yoshikai a al. (1990) Gene 57:257 and Kang et al, loc. cit. . together 

with the promoter DNA sequence as described by Salbaum et al. (1988) EMBO 

7f9V2807. 

As used herein, the terms "label" or "labeled" refers to 
20 incorporation of a detectable marker (e.g., by incorporation of a radiolabeled 
nucleotide or by end-labeling with a terminal radiolabeled phosphate). DNA or 
RNA is typically labeled by incorporation of a radiolabeled nucleotide (H^, C^"*, 
§35^ p32) or a biotinylated nucleotide that can be detected by marked avidin (e.g., 
avidin containing a fluorescent marker or enzymatic activity) or digoxygeninylated 
25 nucleotide that can be detected by marked specific antibody. 

As used herein, "isoform", "APP", and "APP isoform" refer to a 
polypeptide that is encoded by at least one exon of the APP gene (Kitaguchi et al. 
(1988) liamis 211:530; Ponte et al., ML, p.525; R.E. Tanzi, ML, p.528; de 
Sauvage and Octave (1989) Science 241:651; Golde et al. (1990) Neuron 4:253). 
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An APP isoform may be encoded by an APP allele (or exon thereof) that is 
associated with a form of Alzheimer's disease or that is not associated with an AD 
disease phenotype. 

The term "jS-amyloid gene" is used herein as a synonym for the APP 
5 gene, as j3-amyloid is a protein product produced by a post-translational cleavage 
of an APP gene product 

As used herein, "fragment" refers to a polypeptide of at least about 
9 amino acids, typically 50 to 75, or more, wherein the polypeptide contains an 
amino add core sequence Ousted in order from amino- to carboxy-terminal 

10 direction): 

-ne-Ala-Thr-Val-Ile-X-Ile-Thr-Leu- [SEQ ID N0:6] 
where X is any of the twenty conventional amino acids except valine, and 
particularly where X is isoleudne, glycine, or phenylalanine. A fragment may be 
a truncated APP isoform, modified APP isoform (as by amino acid substitutions, 

15 deletions, or additions outside of the core sequence), or other variant polypq)tide 
sequence, but is not a naturally-occurring APP isoform or jS-amyloid polypeptide 
that is present in a human individual, whether affected by AD or not. If desired, 
the fragment may be fused at either terminus to additional amino acids, which may 
number from 1 to 20, typically 50 to 100, but up to 250 to 500 or more. 

20 As used herem, "APP75r and "APP770" refer, respectively, to the 

751 and 770 amino add residue long polypeptides encoded by the human APP 
gene (Ponte et al. loc. cit .: Kitaguchi et al. loc. cit. : Tanzi et al. loc. cit.). 

As used herein, "codon 717" refers to the codon (i.e., the 
trinucleotide sequence) that encodes the 717th amino acid position in APP770, or 

25 the amino acid position in an APP isoform or fragment that corresponds to the 
717th position in APP770. For example but not limitation, a 670 residue long 
fragment that is produced by truncating APP770 by removing the 100 N-terminal 
amino acids has its 617th amino acid position corresponding to codon 717. In fact, 
as used herein, codon 717 refers to the codon that encodes the 698th amino acid 
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residue of APP751 [SEQ ID N0:2] and the 642nd amino acid residue of APP695 
[SEQroNO:!]. 

As used herein, "human APP isoform or fragment" refers to an APP 
isoform or fragment that contains a sequence of at least 9 consecutive amino acids 
5 that is identical to a sequence in a human APP770, APP751, or APP695 protein 
that occurs naturally in a human individual, and wherein an identical sequence is 
not present in an APP protein that occurs naturally in a nonhuman species. 

A nucleic acid is "operably linked" when it is placed into a 
functional relationship with another nucleic acid sequence. For instance, a 
10 promoter or enhancer is operably linked to a coding sequence if it affects the 
transcription of the sequence. With respect to transcription regulatory sequences, 
operably linked means that the DNA sequences being linked are contiguous and, 
where necessary to join two protein coding regions, contiguous and in reading 
frame. 

15 The term "corresponds to" is used herein to mean that a sequence 

is homologous (i.e., is identical, not strictly evolutionarily related) to all or a 
portion of a reference sequence. In contradistinction, the term "complementary 
to" is used herein to mean that the complementary sequence is homologous to all 
or a portion of a reference sequence. For illustration, the nucleotide sequence 

20 "TATAC" corresponds to a reference sequence "TATAC" and is complementary 
to a reference sequence "GTATA". 

The term "transcriptional enhancement" is used herein to refer to 
functional property of producing an increase in the rate of transcription of linked 
sequences that contain a functional promoter. 

25 The term "agent" is used herein to denote a chemical compound, a 

mixture of chemical compounds, a biological macromolecule, or an extract made 
from biological materials such as bacteria, plants, fungi, or animal (particularly 
mammalian) cells or tissues. Agents are evaluated for potential biological activity 
by inclusion in screening assays described hereinbelow. 
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As used herein, the term "mutant" refers to APP alleles having 
missense mutations that are pathognomonic for a genetic predisposition for 
developing AD; specifically a mutation at codon 717 (as referenced by the amino 
acid sequence in APP770) of the APP gene, such that codon 717 encodes one of 
5 the nineteen amino acids that are not valine (i.e., glycine, methionine, alanine, 
saine, isoleucine, leucine, threonine, proline, histidine, cysteine, tyrosine, 
phenylalanine, glutamic acid, tryptophan, arginine, aspartic acid, asparagine, 
lysine, and glutamine), but preferably isoleucine, glycine, or phenylalanine. Thus 
a mutant APP770 polypeptide is an APP770 polypeptide that has an amino acid 

10 residue at position 717 that is not valine. Other mutant APP isoforms comprise 
a non-valine amino acid at the amino acid residue position that corresponds to 
codon 717 (i.e., that is encoded by codon 717). Similarly, a mutant APP allele 
or a variant APP codon 717 allele is an APP allele that encodes an amino acid 
other than valine at codon 717 (referenced to the human APP770 deduced 

15 translation as described in the "codon 717" definition, gypra), preferably 
isoleucine, glycine, or phenylalanine. Hence, an APP allele that encodes valine 
at codon 717 is a "wild-type" APP allele. 

It is apparent to one of skill in the art that nucleotide substitutions, 
deletions, and additions may be incorporated into the polynucleotides of the 

2 0 invention. However, such nucleotide substitutions, deletions, and additions should 
not substantially disrupt the ability of the polynucleotide to hybridize to one of the 
polynucleotide sequences shown in Figs. 5 and 6 under hybridization conditions 
that are suffidently stringent to result in specific hybridization. 

"Specific hybridization" is defined herein as the formation of hybrids 

2 5 between a probe polynucleotide (e.g. , a polynucleotide of the invention which may 
include substitutions, deletion, and/or additions) and a specific target 
polynucleotide (e.g., a polynucleotide having the sequence ), wherein the probe 
preferentially hybridizes to the specific target such that, for example, a band 
corresponding to a variant APP allele or restriction fragment thereof, can be 
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identified on a Southern blot, whereas a corresponding wild-type APP allele (i^, 
one that encodes valine at codon 717) is not identified or can be discriminated 
from a variant APP allele on the basis of signal intensity. Hybridization probes 
capable of specific hybridization to detect a single-base mismatch may be designed 
5 according to methods known in the art and described in Maniatis et al.. Molecular 
Cloning: A Uhnratorv Manual . 2nd Ed., (1989), Cold Spring Harbor, N.Y. and 
Berger and Kimmel, Methods in E nzymolo^v, Volume 152. Guide to Molecular 
Cloning Techniques (1987), Academic Press, Inc., San Diego, CA; Gibbs et al. 
(1989) Nucleic Acids Res. 17:2437; Kwok et al. riQQQV Nucleic Acids Res. 

10 1S:999; Miyada et al. (1987) Methods Enzvmol, 154:94, each of which is 
incorporated herein by reference. The Tj^ for oligonucleotides is calculated under 
standard conditions (1 M NaCl) to be [4^C x (G+C) + 2^C x (A+T)]- While 
the conditions of PGR differ from the standard conditions, this T^j is used as a 
guide for the expected relative stabilities of oligonucleotides, Allele-specific 

15 primers are typically 13-15 nucleotides long, sometimes 16-21 nucleotides long, 
or longer; when short primers are used, such as a 14 nucleotide long primer, low 
annealing temperatures are used, typically 44 to SC'C, occasionally somewhat 
higher or lower depending on the base composition of the sequence(s). 

2 0 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Detection of Mutant Codon 717 APP Alleles 

In an embodiment of the invention, the method involves identifying 
a genetic alteration at amino acid 717, which may cause the consensus Val to be 
changed, for example, to another hydrophobic residue. This will generally be 

25 performed on a specimen removed from the subject. Hydrophobic residues include 
Leu, Ala, He and Gly, the first three of which have aliphatic side chains. Phe also 
has a hydrophobic residue which may be appropriate. As indicated above, 
preferred residues include He, Gly, and Phe (Murrell et al, (1991) Science 
254:97). 
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The fact that these mutations discussed above are at the same codon 
may be a coincidence, but this seems unlikely on statistical grounds. There are 
two possibilities that may explain these data. First, substitution of the valine 
residue at codon 717 may result in increased beta-amyloid dqwsition due to 
5 changes in APP metabolism. Secondly, the variation in the sequence around this 
position may result in increased translation of APP mRNAs and thus cause AD by 
a route analogous to that by which AD is believed to be caused in Down Syndrome 
(Tanzi and Hyman (1991) Nature 350:564 and Rumble et al. (1989) N. Engl. J. 
Med. 320: 1446). In situ hybridization studies have shown that APP 717 mutations 

10 do not alter APP expression (Harrison et al. (1991) Neurorep. 2:152). 

The V717I (APP 717 Val->Ile), V717G (APP 717 
Val->Gly) and V717F (APP 717 Val->Phe) mutations would destabilise a 
putative stem loop structure and destroy a possible iron-responsive element 
between base pairs 2131 and 2156 (Tanzi and Hyman, loc. cit.). There are several 

15 other possible mutations which could also disrupt this structure, many of which 
would be silent at the protdn level; yet these mutations specifically referred to 
have involved a change to the same amino acid, and no silent changes or changes 
to other amino acids have been rq)orted prior to the work described herein. 
Examination of sequence data from 10 other mammalian species (Johnstone et al. 

20 (1991) Mol Brain Res. 10:299) shows that while the valine residue at codon 717 
is conserved in all of them, the putative stem loop structure postulated from the 
human sequence (Tanzi and Hyman loc. cit.) would not be predicted to occur in 
either cattle or sheep; and in pig and mouse the consensus sequence for the 
iron-responsive elements is not present. Finally, such stem loop structures are 

25 believed to modulate gene translation by altering mRNA stability (BOausner and 
Harford (1989) Science 246:870); however, Harrison and colleagues (Harrison et 
al. loc. cit.) have shown by in situ hybridization that APP mRNAs are not grossly 
altered in the brain of an individual with the V717I mutation. For these reasons, 
it is believed likely that alterations in the rate of APP translation caused by the 
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specific mutations identified are not likely to be the key to their pathogenicity. 

The fact that the specific mutations discussed involve different 
changes (Val->Ile, Val->Gly, and Val->Phe) suggests that neither side-chain 
hydrophobicity nor side-chain bulk is the crucial issue. All examples of APP 
5 alleles that encode an amino acid other than valine at codon 717, cosegregate with 
FAD; suggesting that the valine that occurs at position 717 in wild-type APP770 
or APP751 is a critical amino acid residue for non-pathogenic APP proteolytic 
processing (i.e., by the constitutive cleavage pathway). 

The major metabolic pathway for the APP molecule involves 

10 cleavage within the beta-amyloid fragment (Esch et aL loc. ciu). To generate 
beta-amyloid, there must be a second pathway in which APP is cleaved outside this 
sequence. Such a cleavage would be likely to leave a stub of the APP molecule 
containing the beta-amyloid fragment embedded in the membrane. Possibly, the 
beta-amyloid-containing fragment which is generated by the second pathway is 

15 degraded by peptidase action; the reported mutations may be pathogenic because 
peptides which contain them may be more resistant to the actions of this peptidase. 
Therefore, genetic alterations in the APP gene which result in altered (generally 
reduced) degradative properties are particularly important in the application of the 
invention. There are several methodologies available from recombinant DNA 

20 technology which may be used for detecting and identifying a genetic mutation 
responsible for Alzheimer's disease. These include direct probing, polymerase 
chain reaction (PGR) methodology, restriction fragment length polymorphism 
(RFLP) analysis and single strand conformational analysis (SSCA). 

Detection of point mutations using direct probing involves the use 

2 5 oligonucleotide probes which may be prepared synthetically or by nick translation. 
The DNA probes may be suitably labelled using, for example, a radiolabel, 
enzyme label, fluorescent label, biotin-avidin label and the like for subsequent 
visualization in for example a Southern blot hybridization procedure. The labelled 
probe is reacted with the sample DNA bound to a nitrocellulose or Nylon 66 
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substrate. The areas that carry DNA sequences complementary to the labelled 
DNA probe become labelled themselves as a consequence of the reannealling 
reaction. The areas of the filter that exhibit such labelling may then be visualized, 
for example, by autoradiography. 
5 Alternative probing techniques, such as ligase chain reaction (LCR) 

involve the use of mismatch probes, Lfi*, probes which have full complementarity 
with the target except at the point of the mutation. The target sequence is then 
allowed to hybridize both with oligonucleotides having full complementarity and 
oligonucleotides containing a mismatch, under conditions which will distinguish 

10 between the two. By manipulating the reaction conditions it is possible to obtain 
hybridization only where there is fiill complementarity. If a mismatch is present 
then there is significantly reduced hybridization. 

The polymerase chain reaction (PGR) is a technique that amplifies 
specific DNA sequences with remarkable efficiency. Repeated cycles of 

15 denaturation, primer annealing and extension carried out with a heat stable enzyme 
Taq polymerase leads to exponential increases in the concentration of desired DNA 
sequences. 

Given a knowledge of the nucleotide sequence encoding the 
precursor of amyloid protein of AD (Kang et al. loc. at., and Yoshikai, above) 

20 it may be possible to prepare synthetic oligonucleotides complementary to 
sequences which flank the DNA of interest. Each oligonucleotide is 
complementary to one of the two strands. The DNA is then denatured at high 
temperatures (e.g., 95 *C) and then reannealed in the presence of a large molar 
excess of oligonucleotides. The oligonucleotides, oriented with their 3' ends 

25 pointing towards each other, hybridize to opposite strands of the target sequence 
and prime enzymatic extension along the nucleic acid template in the presence of 
the four deoxyribonucleotide triphosphates. The end product is then denatured 
again for another cycle. After this three-step cycle has been repeated several times, 
amplification of a DNA segment by more than one million fold can be achieved. 
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The resulting DNA may then be directly sequenced in order to locate any genetic 
alteration. Alternatively, it may be possible to prepare oligonucleotides that will 
only bind to altered DNA, so that PGR will only result in multiplication of the 
DNA if the mutation is present. Following PGR, allele-specific oligonucleotide 
. 5 hybridization (Dihella et al. (1988) Lancet 1:497) may be used to detect the AD 
point mutation. Alternatively an adaptation of PGR called amplification of specific 
alleles (PASA) can be employed; this uses differential amplification for rapid and 
reliable distinction between alleles that differ at a single base pair. 

In yet another method PGR may be followed by restriction 

10 endonuclease digestion with subsequent analysis of the resultant products. The 
substitution of T for G at base pair 2149, found as a result of sequencing exon 17, 
creates a Bell restriction site. The creation of this restriction endonuclease 
recognition site facilitates the detection of the AD mutation using RFLP analysis 
or by detection of the presence or absence of a polymorphic BcU site in a PGR 

15 product that spans codon 717. 

For RFLP analysis, DNA is obtained, for example, from the blood 
of the subject suspected of having AD and from a normal subject is digested with 
the restriction endonuclease Bell and subsequently separated on the basis of size 
using agarose gel electrophoresis. The Southern blot technique can then be used 

20 to detect^ by hybridization with labeled probes, the products of endonuclease 
digestion. The patterns obtained from the Southern blot can then be compared. 
Using such an approach, DNA spanning an Alzheimer's mutation that creates or 
removes a restriction site at codon 717, such as the Bell site, is detected by 
determining the number of bands detected and compa^ng this number to a 

25 reference allele that has a codon 717 allele that encodes valine. 

Correspondingly, the substitution of G for T at base pair 2150 
creates a 5;^NI restriction site (GCATC), which may be exploited in a manner 
similar to that described above, mutmis mutandis. Similar creation of additional 
restriction sites by nucleotide substitutions within codon 717, wherein the codon 
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717 encodes an amino acid other than valine, can be readily calculated by 
reference to the genetic code and a list of nucleotide sequences recognized by 
restriction endonucleases fPromepa Pro tocols and Applications Guide. (1991) 
Promega Corporation, Madison, Wisconsin). 
5 Single strand conformational analysis (SSCA) (Orita et al. (1989) 

Genomics 5:874 and Orita et al. (1990) Genomics 6:271) offers a relatively quick 
method of detecting sequence changes which may be appropriate in at least some 
instances. 

PCR amplification of specific alleles (PASA) is a rapid method of 

10 detecting single-base mutations or polymorphisms (Newton et al. (1989) Nucleic 
Adds Res. 17:2503: Nichols etal. (1989) fiensmies 5:535; Okayamaetal. (1989) 
T. T^h. Clin. Med. 114: 105: Sarkaretal. ri990) Anal. Biochem. 186 :64: Sommer 
etal. n989 mavo Clin. Proc. 64:1361: Wu nQRQWmc. Natl. Acad. Sci. U.S.A. 
M:2757; and Dutton et al. (1991) Biotechnigues 11:700). PASA (also known as 

15 allele specific amplification) involves amplification with two oligonucleotide 
primers such that one is allele-specific. The desired allele is efficiently amplified, 
while the other allele(s) is poorly amplified because it mismatches with a base at 
or near the 3' end of the allele-specific primer. Thus, PASA or the related method 
of PAMSA may be used to specifically amplify one or more variant APP codon 

20 717 alleles. Where such amplification is done on genetic material (or RNA) 
obtained from an individual, it can serve as a method of detecting the presence of 
one or more variant APP codon 717 alleles in an individual. 

Similarly, a method known a ligase chain reaction (LCR) has been 
used to successfully detect a single-base substitution in a hemoglobin allele that 

25 causes sickle cell anemia (Barany et al. (1991) Proc. Natl. Acad. Sci. U.S.A. 
Sfi:189; Weiss (1991) Science 251:1292). LCR probes may be combined, or 
multiplexed for simultaneously screening for multiple different mutations. Thus, 
one method of screwing for variant APP codon 717 alleles is to multiplex at least 
two, and preferably all, LCR probes that will detect an APP allele having a codon 
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717 that does not encode valine, but that does encode an amino acid. The universal 
genetic code provides the degenerate sequences of all the encoded non-valine 
amino acids, thus LCR probe design for detecting any particular variant codon 717 
allele is straightforward, and multiplexed pools of such LCR probes may be 
5 selected in the discretion of a practitioner for his particular desired use. 

In performing diagnosis using any of the above techniques or 
variations thereof, it is preferable that several individuals are examined. These 
may include an unaffected parent, an affected parent, an affected sibling, an 
unaffected sibling as well as other perhaps more distant family members. 

10 

Model Animals and Cell Lines 

Having identified specific mutations in codon 717 of the /S-amyloid 

gene as a cause of familial Alzheimer's disease (FAD), it is possible, using genetic 

manipulation, to develop transgenic model systems and/or whole cell systems 
15 containing the mutated FAD gene (or a portion thereof) for use, for example, as 

model systems for screening for drugs and evaluating drug effectiveness. 

Additionally, such model systems provide a tool for defining the underlying 

biochemistry of APP and i8-amyloid metabolism, which thereby provides a basis 

for rational drug design. 
20 One type of cell system can be naturally derived. For this, blood 

samples from the affected subject must be obtained in order to provide the 

necessary cells which can be permanently transformed into a lymphoblastoid cell 

line using, for example, Epstein-Barr virus. 

Once established, such cell lines can be grown continuously in 
25 suspension culture and may be used for a variety of in vitro experiments to study 

APP expression and processing. 

Since the FAD mutation is dominant, an alternative method for 

constructing a cell line is to engineer genetically a mutated gene, or a portion 

thereof spanning codon 717, into an established (either stably or transiently) cell 
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line of choice. Sisodia (1990) Science 248:492) has described the insertion of a 
normal APP gene, by transfection, into maminalian cells. Oltersdorf et al. ((1990) 
/. Biol Chem. 265:4492) describe the insertion of APP into immortalized 
eulcaryotic cell lines. 

5 Baculovirus expression systems are useful for high level expression 

of heterologous genes in eukaryotic cells. Knops et al. (1991) /. Biol. Chem. 
2(J6(11):7285 describes the expression of APP using such a system. 

Li yet a further use of the present method, it may be possible to 
excise the mutated gene ^.e., a variant APP codon 717 gene) for use in the 

10 creation of transgenic animals containing the mutated gene. For example, an 
entire human variant APP codon 717 allele may be cloned and isolated, either in 
parts or as a whole, in a cloning vector (e.g., XCharon35, cosmid, or yeast 
artifidal chromosome). The human variant APP codon 717 gene, either in parts 
or in whole, may be transferred to a host nonhuman animal, such as a mouse. As 

15 a result of the transfer, the resultant transgenic nonhuman animal will preferably 
express one or more variant APP codon 717 polypeptides. Most preferably, a 
transgenic nonhuman animal of the invention will express one or more variant APP 
codon 717 polypeptides in a neuron-specific manner (Wirak et al. (1991) gMPQ 
1Q:289). This may be accomplished by transferring substantially the entire human 

20 • APP gene (encoding a codon 717 mutant) including the 4.5 kilobase sequence that 
is adjacent to and upstream of the first major APP transcriptional start site. 

Alternatively, one may design minigenes encoding variant APP 
codon 717 polypeptides. Such minigenes may contain a CDNA sequence encoding 
a variant APP codon 717 polypeptide, preferably full-length, a combination of APP 

25 gene exons, or a combination thereof, linked to a downstream polyadenylation 
signal sequence and an upstream promoter (and preferably enhancer). Such a 
minigene construct will, when introduced into an appropriate transgenic host (e.g.,- 
mouse or rat), repress an encoded variant APP codon 717 polypeptide, most 
preferably a variant APP codon 717 polypeptide tiiat contains either an isoleucine, 
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glycine, or phenylalanine residue at codon 717 of APP770 or the corresponding 
position in an APP isoform or fragment. 

One approach to creating transgenic animals is to target a mutation 
to the desired gene by homologous recombination in an embryonic stem (ES) cell 
5 line in vitro followed by microinjection of the modified ES cell line into a host 
blastocyst and subsequent incubation in a foster mother (see Frohman and Martin 
(1989) Cell 145). Alternatively, the technique of microinjection of the mutated 
gene, or a portion thereof, into a one-cell embryo followed by incubation in a 
foster mother can be used. Various uses of transgenic animals, particularly 

1 0 transgenic animals that express a wild-type APP isoform or fragment, are disclosed 
in Wirak et al (1991) EMBO, 10(2):289; Schilling et al (1991) Gene 98(2):225; 
Quon et al (1991) Nature 252:239; Wirak et al (1991) Science 252:323; and 
Kawabata et al (1991) Nature 254:476. Additional methods for producing 
transgenic animals are known in the art. 

15 Alternatively, site-directed mutagenesis and/or gene conversion can 

be used to mutate a murine (or other nonhuman) APP gene allele, either 
endogenous or transfected, such that the mutated allele does not encode valine at 
the codon position in the mouse APP gene that corresponds to codon 717 (of 
APP770) of the human APP gene (such position is readily identified by homology 

20 matching of the murine APP gene or APP protein to the human APP gene or 
APP770 protein). Preferably, such a mutated murine allele would encode 
isoleucine or glycine or phenylalanine at the corresponding codon position. 

Thgr^pgUtig? 

25 Having detected the genetic mutation in the gene sequence coding 

for )3-amyloid protein in an individual not yet showing overt signs of familial AD, 
using the method of the present invention, it may be possible to employ gene 
therapy, in the form of gene implants, to prevent the development of the disease. 

Additional embodiments directed to modulation of the production of 
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variant APP proteins include methods that employ specific antisense 
polynucleotides complementary to all or part of a variant APP sequence, or for 
some embodiments a wild-type APP sequence. Such complementary antisense 
polynucleotides may include nucleotide substitutions, additions, deletions, or 
transpositions, so long as specific hybridization to the relevant target sequence, 
i.e. . a variant APP codon 717 sequence, is retained as a property of the 
polynucleotide. Thus, an antisense polynucleotide must preferentially bind to a 
variant APP (i.e., codon 717 does not encode valine) sequence as compared to a 
wild-type APP (i.e., codon 717 does encode valine). It is evident that the 
antisense polynucleotide must reflect the exact nucleotide sequence of the variant 
allele (or wild-type allele where desired) and not a degenerate sequence. 

Complementary antisense polynudeotides include soluble antisense 
RNA or DNA oligonucleotides which can hybridize specifically to a variant APP 
MRNA species and prevent transcription of the mRNA species and/or translation 
of the encoded polypeptide (Ching et al. (1989) Pmc. Natl. Acad. Sci. U.S.A. 
M:10006; Broder et al. (1990) Ann. Int. Med . m:604; Loreau et al. (1990) 
FEBS Letters 274:53-56); Holcenberg et al. W091/11535; U.S. No. 7,530,165 
("New human CRIPTO gene" - publicly available through Derwent Publications 
Ltd., Rochdale House, 128 Theobalds Road, London, UK); WO91/09865; 
WC)91/04753; WO90/13641; and EP 386563, each of which is incorporated herein 
by reference). The antisense polynucleotides therefore inhibit production of the 
variant APP polypeptides. Antisense polynucleotides may preferentially inhibit 
transcription and/or translation of mRNA corresponding to a variant (or wild-type) 
polypeptides can inhibit T lymphocyte activation. 

Antisense polynucleotides may be produced from a heterologous 
expression cassette in a transfectant cell or transgenic cell or animal, such as a 
transgenic neural, glial, or astrocytic cell, preferably where the expression cassette 
contains a sequence that promotes cell-type specific expression (Wirak et al. loc. 
ciL). Alternatively, the antisense polynucleotides may comprise soluble 
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oligonucleotides that are administered to the external milieu, either in the culture 
medium in vitro or in the circulatory system or interstitial fluid in vivo . Soluble 
antisense polynucleotides present in the external milieu have been shown to gain 
access to the cytoplasm and inhibit translation of specific mRNA species. In some 
5 embodiments the antisense polynucleotides comprise methylphosphonate moieties. 
For general methods relating to antisense polynucleotides, see Antisense RNA and 
DNA . (1988), D.A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY). 

10 Mutant APP Antigens and Monoclonal Antibodies 

In yet anodier aspect of the invention, having detected a genetic 
alteration in a gene sequence coding for APP, it may be possible to obtain samples 
of the altered jS-amyloid protein from the same source. This protein may be 
derived from Uie brain tissue of a subject diagnosed as suffering from Alzheimer's 

15 disease, or more preferably are produced by recombinant DNA methods or are 
syntiiesized by direct chemical synthesis on a solid support. Such polypeptides will 
contain an amino acid sequence of an APP variant allele spanning codon 717. 
Examples of such sequences are: 

(a) -ne-Ala-Thr-Val-Ile-Gly-ne-Thr-Leu- [SEQ ID N0:7] 

20 (b) -ne-Ala-Thr-Val-ne-Met-Ile-Thr-Leu- [SEQIDN0:8] 

(c) -De-Ala-Thr-Val-Ile-Ala-Ile-Thr-Leu- [SEQ ID N0:9] 

(d) -De-Ala-Thr-Val-Ile-Ser-Ile-Thr-Leu- [SEQ ID NO: 10] 

(e) -Be-Ala-Thr-Val-Ile-Ile-Ile-Thr-Leu- [SEQ ID NO: 11] 

(f) -Ile-Ala-Thr-Val-Ile-Leu-Ile-Thr-Leu- [SEQ ID NO: 12] 
25 (g) -lie-AIa-Thr-Val-ne-Thr-De-Thr-Leu- [SEQ ID NO: 13] 

(h) -Ile-Ala-Thr-Val-Ile-Pro-Ile-Thr-Leu- [SEQ ID NO: 14] 

(i) -Ile-Ala-Thr-Val-Ile-His-Ile-Thr-Uu- [SEQ ID NO: 15] 
(j) -Ile-Ala-Thr-Val-Ile-Cys-Ile-Thr-Leu- [SEQ ID NO: 16] 
(k) -Ile-Ala-Thr-Val-Ile-Tyr-Ile-Thr-Leu- [SEQ ID NO: 17] 
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0) -Ile-Ala-Thr-Val-ne-Phe-Ile-Thr-Leu- [SEQ ID NO: 18] 
(m) -Ile-Ala-Thr-Val-Ile-Glu-Ile-Thr-Leu- [SEQ ID NO: 19] 
(n) -ne-Ala-Thr-Val-Ile-Trp-Ile-Thr-Leu- [SEQIDNO:20] 
(o) -ne-Ala-Thr-Val-Ile-Arg-Ile-Thr-Leu- [SEQ ID N0:21] 
(p) -ne-Ala-Thr-Val-Ile-Asp-IIe-Thr-Leu- [SEQ ID NO:22] 
(q) -ne-Ala-Thr-Val-Ile-Asn-IIe-Thr-Leu- [SEQ ID NO:23] 
(r) -ne-Ala-Thr-Val-He-Lys-Ile-Thr-Leu- [SEQ ID NO:24] 
(s) -Ile-Ala-Thr-Val-Ile-Gln-Ile-Thr-Leu- [SEQ ID NO:25] 



Using such polypeptide material it may then be possible to prepare 
antisera and monoclonal antibodies using, for example, the method of Kohler and 
Milstein ((1975) Nature 256:495). Such monoclonal antibodies could then form 
the basis of a diagnostic test. 

Such variant APP polypeptides may be used to immunize an animal 
for the production of specific antibodies. These antibodies may comprise a 
polyclonal antiserum or may comprise a monoclonal antibody produced by 
hybridoma cells. For general methods to prepare antibodies, see ATltibPdigs; A 
X ^ hnrafnrv Manual . (1988) E. Harlow and D. Lane, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, which is incorporated herein by reference. 

For Kcample but not for limitation, a recombinantly produced 
ftagment of a variant APP codon 717 polypeptide can be injected into a mouse 
along with an adjuvant so as to generate an immune response. Murine 
immunoglobulins which bind the recombinant fragment with a binding affinity of 
at least 1 x 10^ M'^ can be harvested from the immunized mouse as an antiserum, 
and may be fiirther purified by affmity chromatography or other means. 
Additionally, spleen cells are harvested from the mouse and fused to myeloma cells 
to produce a bank of antibody-secreting hybridoma cells. The bank of hybridomas 
can be screened for clones that secrete immunoglobulins which bind the 
recombinantly produced fragment with an affinity of at least 1 x 10^ M'^ More 



wo 92/13069 



PCT/GB92/00123 



25 

specifically, immunoglobulins that bind to the variant APP codon 717 polypeptide 
but have limited crossreactivity with a wild-type (i.e., codon 717 encodes valine) 
APP polypeptide are selected, either by preabsoiption with wild-type APP or by 
screening of hybridoma cell lines for specific idiotypes that preferentially bind the 
5 variant as compared to the wild-type. 

The nucleic acid sequences of the present invention capable of 
ultimately expressing the desired variant APP polypeptides can be formed from a 
variety of different polynucleotides (genomic or cDNA, RNA, synthetic 
oligonucleotides, etc.) as well as by a variety of different techniques, 

10 As stated previously, the DNA sequences will be expressed in hosts 

after the sequences have been operably linked to (i.e. , positioned to ensure the 
functioning of) an expression control sequence. These expression vectors are 
typically replicable in Uie host organisms eitiier as episomes or as an integral part 
of the host chromosomal DNA. Commonly, expression vectors will contain 

15 selection markers, e.g., tetracycline resistance or hygromycin resistance, to permit 
detection and/or selection of tiiose cells transformed with the desired DNA 
sequences (sK, U.S. Patent 4,704,362, which is incorporated herein by 
reference). 

Polynucleotides encoding a variant APP codon 717 polypeptide may 
2 0 include sequences that facilitate transcription (expression sequences) and translation 

of the coding sequences, such that the encoded polypeptide product is produced. 

Construction of such polynucleotides is well known in the art and is described 

further in Maniatis et al. Molecular Cloning: A Laboratory Manual . 2nd Ed. 

(1989), Cold Spring Harbor, N.Y. For example, but not for limitation, such 
25 polynucleotides can include a promoter, a transcription termination site 

(polyadenylation site in eukaryotic expression hosts), a ribosome binding site, and, 

optionally, an enhancer for use in eukaryotic expression hosts, and, optionally, 

sequences necessary for replication of a vector. 
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E. £Qli is one prokaryotic host useful particularly for cloning the 
DNA sequences of the present invention. Other microbial hosts suitable for use 
include bacilli, such as Bacillus subtilus . and other enterobacteriaceae, such as 
Salmonella . Serratia . and various Pseudomonas species. In these prokaryotic 
hosts, one can also make expression vectors, which will typically contain 
expression control sequences compatible with the host ceU (s^, an origin of 
repUcation). In addition, any number of a variety of weU-known promoters will 
be present, such as the lactose promoter system, a tryptophan (tip) promoter 
system, a beta-lactamase promoter system, or a promoter system from phage 
lambda. The promoters will typically control expression, optionally with an 
operator sequence, and have ribosome binding site sequences and the like, for 
initiating and completing transcription and translation. 

Other microbes, such as yeast, may also be used for expression. 
Saccharomvces is a preferred host, with suitable vectors having expression control 
sequences, such as promoters, including 3-phosphoglycerate kinase or other 
glycolytic enzymes, and an origin of replication, termination sequences and the like 
as desired. 

In addition to microorganisms, mammalian tissue cell culture may 
also be used to express and produce the polypq)tides of the present invention (sas, 
Winnacker, "From Genes to Clones," VCH PubUshers, N.Y., N.Y. (1987), which 
is incorporated herein by reference). Eukaryotic cells are actually preferred, 
because a number of suitable host cell lines capable of secreting intact human 
proteins have been developed in the art, and include the CHO cell lines, various 
COS cell lines, HeLa cells, myeloma cell lines, Jurkat cells, etc. Expression 
vectors for these cells can include expression control sequences, such as an origin 
of replication, a promoter, an enhancer (Queen etal. (1986) Tmmynol Rgv. S^'A9, 
which is incorporated herein by reference), and necessary processing information 
sites, such as ribosome binding sites, RNA splice sites, polyadenylation sites, and 
transcriptional terminator sequences. Preferred expression control sequences are 
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promoters derived from immunoglobulin genes, SV40, Adenovirus, Bovine 
Papilloma Vims, and the like. The vectors containing the DNA segments of 
interest (e.g., polypeptides encoding a variant AFP polypeptide) can be transferred 
into the host cell by well-known methods, which vary depending on the type of 
5 cellular host. For example, calcium chloride transfection is commonly utilized for 
prokaryotic cells, whereas calcium phosphate treatment or electroporation may be 
used for other cellular hosts. (SfiS, generally . Maniatis, et al. Molecular Cloning: 
A Laboratory Manual . Cold Spring Harbor Press, (1982), which is incorporated 
herein by reference.) 

10 Alternatively, homologous recombination may be used to insert an 

AFP mutant sequence into a host genome at a specific site, for example, at a host 
AFP locus. In one type of homologous recombination, one or more host 
sequence(s) are replaced; for example, a host AFP allele (or portion thereof) is 
replaced with a mutant AFP allele (or portion thereof). In addition to such gene 

15 replacement methods, homologous recombination may be used to target a mutant 
APP allele to a specific site other than a host AFP locus. Homologous 
recombination may be used to produce transgenic non-human animals and/or cells 
that incorporate mutant AFP alleles. 

The method lends itself readily to the formulation of test kits which 

20 can be utilized in diagnosis. Such a kit would comprise a carrier being 
compartmentalized to receive in close confinement one or more containers wherein 
a first container may contain suitably labelled DNA probes. Other containers may 
contain reagents useful in the localization of the labelled probes, such as enzyme 
substrates. Still other containers may contain a restriction enzyme (such as BcH), 

25 buffers and the like, together with instructions for use. 



EXPERIMENTAL EXAMPLES 
The following examples are provided for illustration and are not 
intended to limit the invention to the specific example provided. 
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FYAMPT,E 1 - Detection of a Val->Ile mutarion in the fl-amvloid rAPP> gene 

The segregation of AD and markers along the long arm of 
chromosome 21 in a single family with autopsy-confirmed Alzhdma-'s disease (see 
Fig. 1) were examined. DNA samples were available ftom a total of six affected 
5 and 33 unaffected and at risk individuals. 

The APP gene in an affected family member was analyzed by 
polymerase chain reaction (PGR) direct sequencing using intronic primws 
(Gyllensten, U. in PGR Technology, Ed. Erlich, H.A., Stockton Press, 45-60, 
1989; Yoshikai et al. (1990) Gene 87:257). (Fig. 2). The primers were made 
10 according to the manufacturer's protocol using a Gene Assembler Plus (Pharmacia 
LBCB). 

PGR was carried out using the following intronic primers in order 
to amplify exon 17 of the APP gene: 

15 [A] 5'-CCTCATGCAAATGTCCGGGTCATr-3' [SEQ ID NO:26] AND 

[B] 5'-GGCrAATrCrCTCATAGTCrrAATrCGCAG-3' [SEQ ID NO 27] 

PGR conditions were 94''G for lOmin to denature; thai 35 cycles of eO'G for 
Imin, 72"G for 3min, 94'G for 1.5min; and a single cycle of 72"C for lOmin. 

20 The reaction was carried out using lOmM tris-HGl pH 8.3, 50mM potassium 
chloride, 0.01% gelatin, 1.5mM magnesium chloride, 200^M of dNTPs, 50 
pmoles of each PGR primer and 1 unit of Taq polymerase. The total final reaction 
volume was 25;tl. 

A second PGR reaction was then performed with a final 

25 concentration of 50 pmol of primer [A] and 0.5 pmol of primer [B]. The PGR 
product was purified on a centricon 100 microconcentrator (Amicon) and used 
directly for sequencing with the SEQUENASE kit (version 2.0, United States 
Biochemical Gorp.; the word SEQUENASE is a trade mark) following the 
manufacturer's protocol. 
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Exon 17 was sequenced first because it encodes part of the 
i3-amyloid peptide and is the site of the mutation (at APP693) leading to Hereditary 
Cerebral Haemorrhage with Amyloidosis-Dutch Type (HCHWA-D). 

Sequencing of exon 17 revealed a C to T transition at base pair 
5 2149, causing a valine to isoleucine change at amino acid 717 (Fig. 2 and Fig. 3)- 

This C to T transition creates a BcH restriction site enabling 
detection within the PGR product (Fig. 4). Bell digests were carried out at SO'^C 
for 2-4 hours, as recommended by the manufacturer, then electrophoresed in 3% 
agarose. 

10 Screening by PGR of 100 unrelated, normal individuals and 14 cases 

(9 families) of familial late onset disease failed to demonstrate this substitution. 
Screening of 11 (9 families) cases of early onset familial disease revealed the BcH 
restriction site in two affected individuals from an unrelated family. The genetic 
data show that the disease loci are linked to the missense mutation. Also, failure 

15 to detect this polymorphism in 200 normal chromosomes supports the contention 
that it is a pathogenic mutation. 

The valine to isoleucine substitution occurs within the 
transmembrane domain two residues from the G-terminus of the /S-amyloid peptide. 
Gomputer analysis predicts that the substitution makes the transmembrane more 

20 hydrophobic and might thus anchor the protein more firmly within the membrane. 
The position of the substitution, two residues from the G-terminus of the i5-amyloid 
peptide may be of significance to the origin of the deposited peptide. This finding 
links Alzheimer's disease to HGHWA-D, a disease in which amyloid deposition 
is due to a mutation closer to the N-terminus but withir the /3-amyloid peptide 

25 (Levy etaL loc. cit.). 
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FXAMPLE 1 - Preparation of a cell line containing a defective fl-amvloid (APP) 
gene 

10ml of fresh blood are collected from each individual suffering 
from familial Alzheimer's disease. Lymphocytes are purified from the blood on a 
Percoll gradient and mixed with Epstein-Barr vims (EBV). The cells are then 
plated out in medium supplemented with 10% foetal calf serum, antibiotics, 
glutamine and Cyclosporin A to kill the T lymphocytes. B lymphocytes which are 
infected by EBV become immortalized and establish a permanent cell line derived 
from the B cells of the patient. 

A lymphoblastoid cell line, AC21, has been deposited with the 
European Collection of Animal Cell Cultures, Porton Down, 



P.YAMPLE 3 - Detection of a Val->G lv mutation in the fl-amvloid fAPP) eene 
A pedigree, designated F19 and shown in Fig. 5, which has 
autopsy-confirmed AD with an onset age of 59±4 years was identified by 
observing that an allele of the highly polymorphic dinucleotide repeat marker 
GT12 (D21S210), which is located close to the APP gene, co-segregated with the 
disease. Linkage analysis gave a peak lod score betweai the marter and the 
disease of 3.02 at a recombination fraction of zero, as the following table shows: 
Theta 0 0.01 0.05 0.1 0.2 0.3 0.4 
Lod 3.02 2.97 2.75 2.47 1.86 1.22 0.6 

Lod scores were calculated with seven liability classes modelling 
age-dependent penetrances from 0.01 to 0.95 with a phenocopy rate of 0.001 and 
a gene frequency of 0.001 using MLINK from the LINKAGE package (Lathrop 
et al. (1984) Proc. Natl Acad. Sci. USA 81:3443). 

APP exon 17 sequences in an affected and an unaffected member of 
F19 were determined. In the affected member, there was a G->T transition at 
position 2150, as can be seen from Fig. 6. 
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The amplification of exon 17 was performed as described in 
Example 1 above and Chartier-Harlin et al. (1991) Neurosci. Letts. 129:134, with 
the following modifications: (a) the amplification primer sequences were: 

ATA-ACC-TCA-TCC-AAA-TGT-CCC-C [SEQ ID NO:28] and 
5 GTA-ACC-CAA-GCA-TCA-TGG-AAG-C [SEQ ID NO:29]; and 

(b) the PGR conditions were 94*C/10 minutes then 35 cycles of 60°C/1 minute, 
72''C/1 minute, 94'C/l minute, followed by 72*'C/5 minutes. 

SO pmol of the second primer were used to generate single stranded 
product, which was then purified (Chartier-Harlin et al loc. cit.). The purified 
10 product was sequenced with the SEQUENASE kit (2.0) (Trade mark; USB) using 
a primer of sequence: 

AAA-TGA-AAT-TCT-TCT-AAT-TGC-G [SEQ ID NO:30]. 

The presence of the T->C transition creates gel artefacts which 
were resolved by the inclusion of inosine (SEQUENASE kit) in the sequencing 
15 reaction. 

Direct sequencing of exons 7 and 16 from affected individuals from F19 (Chartier- 
Harlin et al. he. cit.) shows that these were of normal sequence and SSCA (Orita 
et al. loc. cit.) and Orita et al.) failed to identify changes in exons 2, 3, 7, 9, 12, 
20 13 or 15. SSCA of exon 17 detects both APP693 (Levy, et al. loc. cit. and Hardy 
et al. (1991) Lancet 337:1342-1343) and APP717 Val->ne under standard 
screening conditions and, when modified APP717 Val->Gly. 

EXAMPLE 4 - Productio n of Transgenic Animals with Mutant APP Allele 
25 Generation of the constructs : The vector plink was constructed by 

cloning polylinker between the PvuII and EcoRI sites of pBR322 such that the 
Hindin end of the polylinker was adjacent to the PvuII site. The ligation 
destroyed botii tiie EcoRl and pvull sites associated with the pBR322 segments. 
The 700bp Hpal to EcoRl fragment of pSV2neo (Southern and Berg (1982) L 
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Mnl. Apnl. Genet. 1:327) that contains the SV40 polyadenylation signal was 
cloned into the Hpal to EcoRl sites of plink to generate pNotSV. The 200 bp 
Xhol to PstI fragment of pL2 containing the SV40 16S/lgS splice site (Okayama 
and Berg (1983) Mol. Cell Biol. 2:280) was isolated, blunted with Klenow, then 
5 cloned into the Hpal site of pNotSV to generate pSpIice. The 2.3kb Nrul to Spel 
fragment of pAPP695 containing the coding region of the cDNA for APP (Tanzi 
et al. (1987) Science 235:880) was cloned into the Nrul to Spd site of pSplice to 
graieiate pd695. The same strategy was used to generate pd751 using the cDNA 
for APP751 (Tanzi et al. (1988) Nature 331:528). A variety of promoters have 
10 been inserted into the pd695 and pd751 vectors by using die unique Nrul or the 
Hindm and Nrul sites. 

Generation of pshAPP695 & pshAPP751 : The construct 
pAmyproBam was generated by cloning the 1.5kb BamHI fragment of the APP 
15 cDNA into the BamHI site of pucl9 xHamy. The 700 bp Hindm to Asp718 
fragment of the pAmyproBam (similar to the 700 bp BamHI to Asp718 fragment 
described in Salbaum et al. (1988) EMBO 7:2807) was cloned into the HindlH to 
Asp718 sites of pd695 and pd751 to yield pshAPP695 and pshAPP751. 

2 0 pAPP695.and pAPP751 : The pAPP695 and pAPP751 vectors were 

generated by a three-way ligation of the 3.0 kb EcoRI to Xhol fragment of 
pAmyProBam, the 1.5kb Xhol to Spel fragment of APP751cDNA, and the Spel 
to EcoRI site of pd751. 

25 Generation of pNSE75ir+47) : The pNSE751 (+47) was 

constructed using a three-way ligation of the Hindlll to Kpnl fragment of pNSE6 
(Forss-Petter et al. (1990) Neuron 5: 187). The Kpnl to BstYl fragment of pNSE6 
and a partial BamHI (-47nt relative to tiie ATG) to Hindlll fragment of pAPP751. 
This resulted in the generation of a Kpnl fragment that was cloned into the Kpnl 
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sites of pNSE751(+47). The BstYl/Bam fusion results in the loss of both sites. 

Generation of pNSE751 : This vector was generated using a four 
primer two-step PGR protocol (Higuchi et al (1988) Nucl Acids Res. 16:7351) 
5 that resulted in a direct fusion of the NSE initiation codon to the APP coding 
region. Oligonucleotides C2, 1072, 1073, and A2 (see Nucleotide Sequences, 
infra.) were used to generate a PGR product. The Kpnl fragment was generated 
by digestion with the restriction enzyme. The Kpnl fragment was used to replace 
a similar fragment in pNSE751(+47), 

10 

Generation of pNSE751-Hardv and pNSE751-Dutch : The Hardy 
(APP642 Val->Ile of APP695) and Dutch (APP618 Gln->Glu of APP695) 
mutations were introduced using a four primer two-PGR protocol. Both sets of 
reactions used the same "outside primers" with the "inside primers" containing the 
15 appropriate mutations. This resulted in the generation of Bgin to Spel fragment 
after digestion, that contained either the Dutch or the Hardy mutation. The Bgin 
to Spel fragment of pNSE751 was replaced by the mutated fragment to generate 
the appropriate vector. The presence of the mutation was conformed by sequence 
analysis of the vectors. 

20 

Generation of pNSE751-Hardv and pNSE751 -Dutch : The Hardy VI 
(APP642 V to I), Hardy VG (APP642 V to G), and Dutch (APP618 E to Q) 
mutations were introduced using the four primer two-step PGR protocol (Higuchi 
et al (1988)). The Hardy VI mutant was generated using primers 117/738, 922, 
25 923, and 785; Hardy VG mutant was generated using primers 117/738, 1105, 
1106, and 785; Dutch mutant was generated using primers 117/738, 1010, 1011, 
and 785. In all these mutations the 700 bp Bglll to Spel fragment was isolated by 
digestion of the PGR product with the restriction enzymes, then cloned into the 
same sites of pNSE751, The mutations were confirmed by sequence analysis. 
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npnffratinn ofnNFH751 : The human NFH gene (Lees et al. (1988) 
EMBO 7(7): 1947) was isolated from a genomic library using a rat NFH cDNA as 
a probe (Lieberburg et al. (1989) Proc. Natl. Acids. Res. USA 86:2463). An SstI 
ftagment was subcloned into the pSK vector. A pair of PGR primers was 
5 generated to place a Nrul site at the 3' end of the 150 bp amplified fragment 
immediately upstream of the initiation codon of the NFH gene. The 5' end 
contains a Kpnl site 150nt upstream of the initiation codon. The final construction 
of pNFH751 was generated by a three-way ligation of the 5.5b Hindm to Kpnl 
ftagment of pNFH8.8, the Kpnl to Nrul PGR generated ftagment, and the Hindm 

10 to Nrul ftagment of pd751. The sequence surrounding the PGR generated fusion 
at the initiation codon was confirmed by sequence analysis. The Dutch and Hardy 
variants of pNFH751 were generated by substitution of the 600 bp Bgin to Spel 
ftagment from a sequence confirmed mutated vector for the same fragment of 
pNFH751. The presence of the mutation was confirmed by hybridization with the 

15 mutated oligomer or by sequence analysis. 

Generation of pThv751 : The pThy751 vector was generated by a 
three-way Ugation. The Hindm to BamHI fragment of pThy8.2 which was 
isolated ftom a human genomic library (Ghang etal. (1985) Proc. Natl. Acad. Sci. 
2 0 USA 82:3819), the synthetic ftagment ThyAPP, and the Hindm to Nrul fiagment 
ofpd751. 
ThyAPP: 

GAGAGTGAGATGGGAGAAGGGTAGGTGTGAGTGTAGGGTGTTGGESEQ 
ID N0:31] 

25 

Generation of pThvClOO : This pThyGlOO construct was generated 
by a three-way ligation. The 3.6kb Hindlll to BamHI ftagment of pThy8.2, the 
synthetic fragment ThyAPP2, and the HindUI to Bglll fragment of pd751 or 
pNSE751 Dutch or pNSE751 Hardy were ligated to yield pThyClOO. 
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ThyAPP2: 

CAGACTGAGATCCCAGAACCGATCCTAGGTCTGACTCTAGGGTCTTGG 
[SEQ ID NO:32] 

The region around the initiation codon was confirmed by sequence 

5 analysis. 

Preparation of DNA for injection : The transgene for injection was 
isolated from the corresponding vector of interest for digestion with NotI and gel 
electrophoresis. The transgene was purified by using the Gene Clean kit (BiolOl), 
10 then further purified on an Elutip or HPLC purified on a Nucleogen 4000 column. 

Microinjection : The transgene was injected at 2-20 mcg/ml into the 
most convenient pronucleus (usually the male pronucleus) of FVB or B6D2F2 one- 
cell embryos (Manipulating the Mouse Embryo, B. Hogan, F. Constantini, E. 

15 Lacy, Cold Spring Harbor . 1986), The injected embryos were cultured overnight. 

Embryos that split to the two-cell stage were implanted into pseudo-pregnant 
female CDl mice. The mice were weaned at approximately 21 days. Samples of 
DNA obtained from tail biopsy were analyzed by Southern blot using a transgene 
specific probe (usually the SV40 3's splice and polyadenylation signal sequences). 

20 Transgenic mice harboring at least one copy of the transgene were identified. 

Use of Transgenic Mice: A mouse that expresses the hAPP gene 
or its variants can be used to test the pathogenesis of amyloid deposition and 
therapeutic intervention designed to modulate amyloid deposition. 

Biochemical analysis of the transgenic mice reveals possible 

25 intermediates in die catabolism of APP that are likely precursors to beta-amyloid. 
This analysis can be carried out in the animal or in primary tissue culture of the 
expressing cells. 

The animal can be used to test potential tiierapeutic agents. The test 
group of mice is treated with the test compound administered in an appropriate 
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fashion for a set period. At the conclusion of the test period, the animals are 
assessed behaviourally, biochemically, and histologically for any possible effects 
of the test compound. The exact protocol depends on the anticipated mechanism 
of action of the test compound. Compounds that may have utility in treating AD 
5 can be identified using this approach. 
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SEQUENCE LISTING 



1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Imperial College of Science, Technology & 

Medicine (not US) 

(B) STREET: Sherfield Building, Exhibition Road, 

(C) CITY: London 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : SW7 2AZ 

(A) NAME: HARDY, John Anthony (US only) 

(B) STREET: 187 DraJcefell Road 

(C) CITY: London 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : SE4 

(A) NAME: GOATE, Alison Mary (US only) 

(B) STREET: 100 High Street, Hampton Wick, 

(C) CITY: Kingston-on-Thames 

(D) STATE: Surrey 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : KTl 4DQ 

(A) NAME: MULLAN, Michael John (US only) 

(B) STREET: Suncoast Gerontology Ctr, 12901 Bruce B. 

Downs Blvd. MDC 50, 

(C) CITY: Tampa 

(D) STATE: Florida 

(E) COUNTRY: US 

(F) POSTAL CODE (ZIP): 33612 

(A) NAME: CHARTIER-HARLIN , Marie-Christine (US only) 

(B) STREET: 63 Francis Road 

(C) CITY: London 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : ElO 6PN 

(A) NAME: OWEN, Michael John (US only) 

(B) STREET: Four Hedges, Castlehill, LLanblethian, 

(C) CITY: Cowbridge 

(D) STATE: South Glamorgan 

(E) COUNTRY: GB 

(ii) TITLE OF INVENTION: Test and Model for Alzheimer's Disease 

(iii) NUMBER OF SEQUENCES: 44 

(iv) COMPUTER READABLE FORM: 
Not Applicable 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB92/ 
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INFOPMATIOM FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr LYS Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 
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Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu . 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 . 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 

325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 

405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 



Tyr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg lie Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 
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Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Glv Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 

645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

Phe Phe Glu Gin Met Gin Asn 
690 695 
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) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 
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Val Val GlU Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met lie 
290 295 300 

ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 



315 



305 310 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 



325 



330 



CVS Met Ala Val Cys Gly Ser Ala He Pro Thr Thr Ala Ala Ser Thr 
^ 340 345 350 

Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu 
355 360 365 

His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala Lys His Arg 
370 375 380 

Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gin 
' 395 400 



385 



390 



Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He Gin His Phe 

405 410 415 

Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn Glu Arg Gin 
420 425 430 

Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met Leu Asn Asp 
435 440 445 

Arg Arg Arg Leu Ala Leu Glu Asn Tyr He Thr Ala Leu Gin Ala Val 
450 455 460 

Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys Tyr Val Arg 
465 470 475 480 

Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe Glu His Val 

485 490 495 

Arg Met Val Asp Pro Lys Lys Ala Ala Gin He Arg Ser Gin Val Met 
500 505 510 

Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser Leu Ser Leu 
515 520 525 
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Leu Tyr Asn Val Pro Ala Val Ala Glu Glu lie Gin Asp Glu Val Asp 
530 535 540 

Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val Leu Ala Asn 
545 550 555 560 

Met lie Ser Glu Pro Arg lie Ser Tyr Gly Asn Asp Ala Leu Met Pro 

565 570 575 

Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly 
580 585 590 

Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe Gly Ala Asp 
595 600 605 

Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg 
610 615 620 

Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser Gly Leu Thr 
625 630 635 640 

Asn lie Lys Thr Glu Glu lie Ser Glu Val Lys Met Asp Ala Glu Phe 

645 650 655 

Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu Val Phe Phe 
660 665 670 

Ala Glu Asp Val Gly Ser Asn Lys Gly Ala lie lie Gly Leu Met Val 
675 680 685 

Gly Gly Val Val He Ala Thr Val He Val He Thr Leu Val Met Leu 
690 695 700 

Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val Glu Val Asp 
705 710 715 720 

Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 

725 730 735 



Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
740 745 750 
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INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 770 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 
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Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met lie 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 

325 330 335 

Cys Met Ala Val Cys Gly Ser Ala Met Ser Gin Ser Leu Leu Lys Thr 
340 345 350 

Thr Gin Glu Pro Leu Ala Arg Asp Pro Val Lys Leu Pro Thr Thr Ala 
355 360 365 

Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp 
370 375 380 

Glu Asn Glu His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala 
385 390 395 400 

Lys His Arg Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala 

405 410 415 

Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val lie 
420 425 430 

Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn 
435 440 445 

Glu Arg Gin Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met 
450 455 460 

Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn Tyr lie Thr Ala Leu 
465 470 475 480 

Gin Ala Val Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys 

485 490 495 

Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe 
500 505 510 

Glu His Val Arg Met Val Asp Pro Lys Lys Ala Ala Gin lie Arg Ser 
515 520 525 
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Gin Val Met Thr His Leu Arg Val lie Tyr Glu Arg Met Asn Gin Ser 
530 535 540 

Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala Glu Glu He Gin Asp 
545 550 555 560 

Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val 

565 570 575 

Leu Ala Asn Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala 
580 585 590 

Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro 
595 600 605 

Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe 
610 615 620 

Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val 
625 630 635 640 

Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser 

645 650 655 

Gly Leu Thr Asn He Lys Thr Glu Glu He Ser Glu Val Lys Met Asp 
660 665 670 

Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu 
675 680 685 

Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He He Gly 
690 695 700 

Leu Met Val Gly Gly Val Val He Ala Thr Val He Val He Thr Leu 
705 710 715 720 

Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val 

725 730 735 

Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met 
740 745 750 

Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met 
755 760 765 



Gin Asn 
770 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



AT6CTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


GCTG6AGGTA 


60 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


TTGCCATGTT 


CTGTGGCAGA 


120 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


180 


ACCTGCATTG 


ATACCAAGGA 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


240 


CAGATCACCA 


AT6TGGTAGA 


A6CCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAA6CGG 


300 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


CTTAGTTGGT 


360 


GAGTTTGTAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


AATTCTTACA 


CCAGGAGAGG 


420 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


480 


AA6AGTACCA 


ACTTGCATGA 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATT6A 


CAAGTTCCGA 


540 


GGGGTAGAGT 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


600 


GCG6AGGAGG 


ATGACTCGGA 


T6TCT6GTG6 


GGCGGAGCAG 


Aa\CAGACTA 


TGCA6ATGGG 


.660 


AGTGAA6ACA 


AAGTA6TAGA A6TAGCAGAG 


GAGGAAGAAG 


TGGCTGAGGT 


GGAA6AAGAA 


720 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


780 


CCCTACGAAG 


AAGCCACAGA 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


840 


GAGTCTGTGG 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


900 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


GAAAGCCAAA 


960 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


TCATGAGAGA 


ATGGGAAGAG 


1020 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


1080 


CAGGAGAAAG 


TGGAATCTTT 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


1140 


ACACACATGG 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1200 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


TATGCTAAAG 


1260 
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AAGTATGTCC GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


TAAAGCATTT 


CGAGCATGTG 


1320 


rnrXTGGTGG ATCCCAAGAA 


AGCCGCTCAG 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


1380 


rsTOATTTATG AGCGCATG7LA 


TCAGTCTCTC 


TCCCTGCTCT 


ACAACGTGCC 


TGCAGTGGCC 


1440 


GAGGAGATTC AGGATGAAGT 


TGATGAGCTG 


CTTCAGAAAG 


AGCAAAACTA 


TTCAGATGAC 


1500 


6TCTTGGCCA ACATGATTAG 


TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


TCTCATGCCA 


1560 


TCTTTGACCG AAACGAAAAC 


CACCGTGGAG 


CTCCTTCCCG 


TGAATGGAGA 


GTTCAGCCTG 


1620 


GACGATCTCC AGCCGTGGCA 


TTCTTTTGGG 


GCTGACTCTG 


TGCCAGCCAA 


CACAGAAAAC 


1680 


GAAGTTGAGC CTGTTGATGC 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


1740 


TCTGGGTTGA CAAATATCAA 


GAC6GAGGAG 


ATCTCTGIU^G 


TGAAGATGGA 


TGCAGAATTC 


1800 


CGACATGACT CAGGATAIGA 




CAAAAATTGG 


TGTTCTTTGC 


AGAAGATGTG 


1860 


GGTTCaAACA AAGGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


GTGTTGTCAT 


AGCGACAGTG 


1920 


ATCGTCATCA CCTTGGTGAT 


GCTGAAGAAG 


AAACAGTACA 


CATCCATTCA 


TCATGGTGTG 


1980 


GTGGAGGTTG ACGCCGCTGT 


CACCCCAGAG 


GAGCGCCACC 


TGTCCAAGAT 


GCAGCAGAAC 


2040 


GGCTACGAAA ATCCAACCTA 


CAAGTTCTTT 


GAGCAGATGC 


AGAACTAG 




2088 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


GCTGGAGGTA 


60 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


TTGCCATGTT 


CTGTGGCAGA 


120 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


180 


ACCTGCATTG 


ATACCAAGGA 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


0^0\0%mf% •» # uii^ 

CCCTGAACTG 


*> VI n 


CAGATCACCA 


AT6TGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACT6 


GTGCAAGCGG 




GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


CTTAGTTGGT 


^ c n 


GAGTTTGTAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


AATTCTTACA 


0%0%^ ^ ^ K 
CCAGGAGAGG 




ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


A Q n 


AAGAGTACCA 


ACTTGCATGA 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGU X CUijA 


dh u 


6G6GTAGAGT 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 




Ann 


GCGGAG6AGG 


AT6ACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


TGCAGATGGG 


660 


AGTGAAGACA 


AA6TAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


TGGCTGAGGT 


GGAAGAAGAA 


720 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


6GTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


780 


CCCTACGAAG 


AAGCCACAGA 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


840 


GAGTCTGTGG 


AAGAGGTGGT 


TCGAGAGGTG 


TGCTCTGAAC 


AAGCCGAGAC 


GGG6CCGTGC 


900 


CGAGCAATGA 


TCTCCCGCTG 


GTACTTTGAT 


GTGACTGAAG 


GGAAGTGTGC 


CCCATTCTTT 


960 


TACGGCGGAT 


GTGGCGGCAA 


CCGGAACAAC 


CGGAACAACT 


TTGACACAGA 


AGAGTACTGC 


1020 


ATGGCCGTGT 


GTGGCAGCGC 


CATTCCTACA 


ACAGCAGCCA 


GTACCCCTGA 


TGCCGTTGAC 


1080 


AAGTATCTCG 


AGACACCTGG 


GGATGAGAAT 


GAACATGCCC 


ATTTCCAGAA 


AGCCAAAGAG 


1140 


AGGCTTGAGG 


CCAAGCACCG 


AGAGAGAATG 


TCCCAGGTCA 


TGAGAGAATG 


GGAAGAGGCA 


1200 


GAACGTCAAG 


CAAAGIU^CTT 


GCCTAAAGCT 


GATAAGAAGG 


CAGTTATCCA 


GCATTTCCAG 


1260 
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GAGAAA6TG6 


AATCTTTGGA 


ACAGGAAGCA 


GCCAACGAGA 


GACAGCAGCT 


GGTGGAGACA 


1320 


CACATGGCCA 


6AGTGGAAGC 


CATGCTCAAT 


GACCGCCGCC 


GCCTGGCCCT 


GGAGAACTAC 


1380 


ATCACCGCTC 


TGCAGGCTGT 


TCCTCCTCGG 


CCTCGTCACG 


TGTTCAATAT 


GCTAAAGAAG 


1440 


TATGTCCGCG 


CAGAACAGAA 


GGACAGACAG 


CACACCCTAA 


AGCATTTCGA 


GCATGTGCGC 


1500 


ATGGTG6ATC 


CCAAGAAAGC 


CGCTCAGATC 


CGGTCCCAGG 


TTATGACACA 


CCTCCGTGTG 


1560 


ATTTATGAGC 


GCATGAATCA 


GTCTCTCTCC 


CTGCTCTACA 


ACGTGCCTGC 


AGTGGCCGAG 


1620 


GAGATTCAGG 


ATGAAGTTGA 


TGAGCTGCTT 


CAGAAAGAGC 


AAAACTATTC 


AGATGACGTC 


1680 


TTGGCCAACA 


TGATTAGTGA 


ACCAAGGATC 


AGTTACGGAA 


ACGATGCTCT 


CATGCCATCT 


1740 


TTGACCGAAA 


CGAAAACCAC 


CGTGGAGCTC 


CTTCCCGTGA 


ATGGAGAGTT 


CAGCCTGGAC 


1800 


GATCTCCAGC 


CGTGGCATTC 


TTTTGGGGCT 


GACTCTGTGC 


CAGCCAACAC 


AGAAAACGAA 


1860 


GTTGAGCCTG 


TTGATGCCCG 


CCCTGCTGCC 


GACCGAGGAC 


TGACCACTCG 


ACCAGGTTCT 


1920 


GGGTTGACAA 


ATATCAAGAC 


GGAGGAGATC 


TCTGAAGTGA 


AGATGGATGC AGAATTCCGA 


1980 


CATGACTCAG 


GATATGAAGT 


TCATCATCAA 


AAATTGGTGT 


TCTTTGCAGA AGATGTGGGT 


2040 


TCAAACAAAG 


GTGCAATCAT 


TGGACTCATG 


GTGGGCGGTG 


TTGTCATAGC 


GACAGTGATC 


2100 


GTCATCACCT 


TGGTGATGCT 


GAAGAAGAAA 


CAGTACACAT 


CCATTCATCA 


TGGTGTGGTG 


2160 


GAGGTTGACG 


CCGCTGTCAC 


CCCAGAGGAG 


CGCCACCTGT 


CCAAGATGCA 


GCAGAACGGC 


2220 


TACGAAAATC 


CAACCTACAA 


GTTCTTTGAG 


CAGATGCAGA 


ACTAG 




2265 
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INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 aunino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

lie Ala Thr Val lie Xaa lie Thr Leu 
1 5 



wo 92/13069 



PCT/GB92/00123 



52 

INFORHATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

lie Ala Thr Val He Gly He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

He Ala Thr Val He Met He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 9 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

lie Ala Thr Val lie Ala lie Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

lie Ala Thr Val lie Ser lie Thr Leu 
1 5 
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(2) INFOPMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

He Ala Thr Val He He He Thr Leu 
1 5 



^ 
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INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: 2unino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

lie Ala Thr Val lie Leu lie Thr Leu 
1 5 
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) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

lie Ala Thr Val He Thr He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 anino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 14: 

He Ala Thr Val He Pro He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

He Ala Thr Val He His He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

lie Ala Thr Val lie Cys lie Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE C3IARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

He Ala Thr Val He Tyr He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

He Ala Thr Val He Phe He Thr Leu 
1 5 



wo 92/13069 



PCT/GB92/00123 



64 

INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

He Ala Thr Val He Glu He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 20; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

lie Ala Thr Val lie Trp lie Thr Leu 
1 5 



wo 92/13069 



PCT/GB92/00123 



66 

INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

He Ala Thr Val He Arg He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

He Ala Thr Val He Asp He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

He Ala Thr Val He Asn He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

lie Ala Thr Val lie Lys lie Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

lie Ala Thr Val lie Gin lie Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO; 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCTCATCCAA ATGTCCCCGT CATT 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCCTAATTCT CTCATAGTCT TAATTCCCAC 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
ATAACCTCAT CCAAATGTCC CC 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTAACCCAAG CATCATGGAA GC 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AAATGAAATT CTTCTAATTG CG 



wo 92/13069 



PCr/GB92/00123 



76 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CAGACTGAGA TCCCAGAACC CTAGGTCTGA CTCTAGGGTC TTGG 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CAGACTGAGA TCCCAGAACC GATCCTAGGT CTGACTCTAG GGTCTTGG 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CGACCAGGTT GTGGGTTGAC AAATA 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AATCTATTCA TGCACTAGTT TGATACAGC 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ACAGTGATCA TCATCACCTT G 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CAAGGTGATG ATGATCACTG T 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
AGCGACAGTG ATCGGCATCA CCTTGGTG 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CACCCAGGTG ATGCCGATCA CTGTCGCT 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
ACCCACATCT TGTGCAAAGA ACAC 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GTGTTCTTTG CACAAGATGT GGGT 
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(2) INFOKMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCCAGCCATC ATGCTGCCCG GGTTGGC 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GCCAACCCGG GCAGCATGAT GACTGGGATC TC 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
ACCTGCCACT ATACTGGAAT A 



wo 92/13069 



PCT/GB92/00123 



89 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TGTGCATGTT CAGTCTGCCA C 
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wpfrT Ts r.r.AT\ffiD IS; 

1. An isolated polynucleotide comprising a nucleic acid sequence 
encoding a codon 717 mutant of human amyloid precursor protein 770 (APP770), 

5 or an isoform or ftagment of APP770 having a mutant amino acid residue at the 
position encoded by codon 717. 

2. An isolated polynucleotide of claim 1, wherein the amino acid 
at the position encoded by codon 717 is an isoleucine, glycine, or phenylalanine. 

10 

3. An isolated polynucleotide of claim 1, wherein the nucleic acid 
sequence is a cDNA. 

4. A composition comprising a polynucleotide probe capable of 
15 specifically hybridizing to an amyloid precursor protein 770 (APP770) allele 

echibiting a mutation at codon 717. 

5. A composition of claim 4, wherein codon 717 of the mutant 
allele encodes an isoleucine, phoiylalanine, or glycine. 



20 



25 



6. A composition of claim 4, wherein the probe is labeled. 

7. A composition of claim 4, wherein the probe comprises at 
least about 10 nucleotides spanning amino acid 717 of the APP770 allele. 

8. A transgenic host comprising a nucleic acid segment 
encoding a position 717 mutant of human amyloid precursor protein 770 
(APP770), or an APP770 isoform or fragment of APP770 having the mutation. 
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9. A host of claim 8, which is a primary or immortalized 
eukaryotic cell line. 

10. A host of claim 8, which is a bacterium. 

5 

11. A host of claim 8, wherein the segment is integrated into the 

host genome. 

12. A host of claim 8, which is a non-human animal having the 
10 DNA segment incorporated into its germline and which is capable of expressing 

the mutant APP770 protein. 



13. A host of claim 12, wherein the mutant APP770 protein is 
the sole APP770 protein produced by the animal. 

15 

14. A transgenic non-human animal with germ cells or somatic 
cells comprising a heterologous gene encoding a position 717 mutant amyloid 
precursor protein 770 (APP770), which gene upon expression promotes 
neuropathological characteristics of Alzheimer's disease in the animal. 

20 

15. A cultured human primary or immortalized cell, comprising 
a nucleic acid segment encoding a position 717 mutant of human amyloid precursor 
protein 770 (APP770), or an APP770 isoform or fragment of APP770 having the 
mutation. 

25 

16. A method of screening for an agent capable of treating 
Alzheimer's disease, comprising: 

contacting a host of claim 8 with the agent; and monitoring 
expression or processing of proteins encoded by the mutant APP770 gene. 
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17. A diagnostic method for determining an inherited 
predisposition to Alzheimer's disease in a subject, comprising detecting in the 
subject the presence of an allele of amyloid precursor protein (APP), an isoform 
or fragment thereof, wherein said allele has a sequence polymorphism at a position 

5 corresponding to codon 717 of APP770. 

18. A method of claim 17, wherein said sequence polymorphism 
is a nucleotide substitution, whereby an isoleucine or glycine is substituted at 
codon 717 of APP770. 

10 

19. A method of claim 17, wherein said sequence polymorphism 
is a single nucleotide substitution. 

20. A method according to claim 17, wherein the detecting step 
15 comprises sequencing a genomic DNA segment from chromosome 21 of the 

subject. 

21. A method according to claim 17, wherein the detecting step 
comprises (i) mixing a nucleic acid sample from the subject with one or more 

2 0 polynucleotide probes capable of hybridiang selectively to an APP gene allele in 
a reaction and (ii) monitoring the reaction to determine the presence of the gene 
allele in the sample, thereby indicating whether the subject is at risk for 
Alzheimer's disease. 



25 



22. A method according to claim 20, wherein one probe is a 
polynucleotide comprising a sequence of at least about 10 nucleotides spanning 
codon717ofan APP770. 
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23. A method according to claim 22, wherein the probes are 
oligonucleotides capable of priming polynucleotide synthesis in a polymerase chain 
reaction, wherein a reaction product comprises a sequence of at least 25 contiguous 
nucleotides from exon 17 of the APP gene. 

5 

24. A method according to claim 21, wherein at least one 
oligonucleotide specifically hybridizes to a sequence present in an intron or 
flanking region of an APP770 gene. 

10 25. A method according to claim 21 , wherein the monitoring step 

comprises analyzing sequencing gel reaction products from the PGR reaction. 

26. A method according to claim 21 , wherein the monitoring step 
comprises analyzing an autoradiograph of a Bell digest of reaction products from 

15 the PGR reaction. 

27. A method according to claim 17, wherein the detecting step 
comprises (i) mixing in an immunological assay an APP770 or isoform protein 
sample from the subject with an antibody reagent specific for the allele and 

20 (ii) monitoring the assay to determine specific binding between the antibody 
reagent and the protein sample, thereby indicating whether the subject is at risk for 
Alzheimer's disease, 

28. A method according to claim 27, wherein the antibody 
2 5 reagent is a monoclonal antibody specifically reactive with an antigenic determinant 

specific for an allele, 

29. A method for genetic analysis of a human subject which 
comprises detecting the presence or absence of at least one polymorphism at codon 

30 717 of an APP770 gene of an amyloid precursor protein (APP) gene in the subject. 
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30. A method according to claim 29, wherein the polymorphism 
is detected by digesting genomic DNA from the subject with at least one restriction 
endonuclease and hybridizing resulting fragments with a detecting probe 

5 31. A composition comprising a polypq)tide free from human 

proteins, comprising a core sequence: 

ne-Ala-Thr-Val-Ile-X-Ile-Thr-Leu- [SEQ ID N0:6] 
wherein X is any of the twenty conventional amino acids except valine. 

10 32. A transgenic nonhuman animal containing a polypeptide of 

claim 31. 

33. A transgenic nonhuman animal of Claim 32, wherein the 
polypeptide is preset in the brain. 

15 

34. An isolated polynucleotide, comprising a nucleic acid 
sequence encoding a mutant human APP allele that cosegregates with a genetic 
predisposition to Al^eimer's disease. 

20 35. An isolated polyp^tide of Qaim 34, wherein said mutant 

human APP allele comprises a codon 717 mutant. 

36. A method of determining a genetic predisposition of a subject 
to Alzheimer's disease, the method comprising detectmg in the subject's DNA the 

25 presence of an allele of a gene raicoding amyloid precursor protein (APP). 

37. A method as claimed in claim 36, wherein the step of 
detection is carried out on material removed from, and not returned to, the 
subject's body. 
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38. A method as claimed in claim 36 or 37, wherein the allele 
of the gene encodes a substitution mutant of APP. 

39. A method as claimed in claim 38, wherein a single amino 
5 acid is substituted for another. 
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Acoorilag to iBtcraatfo&al Pstcnt Oasstfiettios QPQ or to both NatioBil Gassificition nai IPC 

Int.Cl. 5 C12N15/00; C07K13/00; C12Q1/68; 

C12N5/10; C12N1/21 



C12N15/12 



n. FIELDS SEAKCHED 



Mlalmiim OecaantitlsD Smnbtf 



Cluslflcstisa Syttai 



dissiflotlM Syabels 



Int.Cl. 5 



C12N ; 



C07K ; 



C12q 



DoGUB«Btatio& Starcfatd otbcr thin MinlmuiB DooincBtitioa 
to th« Extc&t that such Dooizatnts art I&dad«d in the Flttds S«afch«i' 



m. DOCUMENTS CONSIDERED TO BE RELEVANT* 



Category •= 



OtatlOD of Dociuient, with indlcatton, when appiopriata, of th« rdtvut passagts ^ 



Rtltvant to Claim No.^ 



NATURE 

vol. 349, no. 6311, 21 February 1991, LONDON GB 
pages 704 - 706; 

GOATE, A. ET AL: 'Segregation of a ralssense 
mutation In the amyloid precursor protein gene 
with familial Alzheimer's disease' 
see the whole document 

SCIENTIFIC AMERICAN. 

vol. 265, no. 5, November 1991, NEW YORK US 
pages 40 - 47; 

SELKOE, D.: 'Amyloid protein and Alzheimer 
disease' 

see page 43 - page 44 



1-7, 

17-31, 

34-39 



1-4, 
31-35 



o Spodal catcgorks of dtH ioammU t 

'A' riocniacntriefiiUDgthtMMnJstattofthf artvifatcfaiSBot 
coasUcrH to b« of parttailar rricviaca 
mUff 4oana«it but pabUsbod 00 or afttr th< iBtanuttonal 
fillog^ato 

IT docmatfit which Bay throw 4oobCs oo priority cUiiB<s) or 
which is dt*4 to establish tho pubUcatfoD datt of aaothtf 
dtatioB or other spedal reason (as specified) 

'O' docomcnt referriag to an onl dlsdosnre, usi^ cxhIUtieB or 
other means 

^P* iocamnt poblished prior to the btvaationai fUing date but 
later than the priority date claimed 



later docnment published afler the Intcmattonal filing datt 
or priority date and not in conflict with the epeUcatioa but 
dtH to understand the prindpte or theory underlying the 



'X' document of paiticBlar relevance; the claimed invention 
cannot be considered novel or cannot be constderad to 
Involve an inventive step 

'V docnm«it of paitlcnIarTelevanee; the claimed Invention 
cannot be considered to involve an inventive step when the 
document U combined with one or more other SUA dpcn-^ 
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